7. Sequence Listing 

SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT; Hook, Magnus 

Patti, Joseph M. 
House -Pompeo, Karen 
S thanam , Nar ayana 
Symersky, Jindrich 

(ii) TITLE OF INVENTION: COLLAGEN BINDING PROTEIN COMPOSITIONS 
AND METHODS OF USE 

(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Arnold, White & Durkee 

(B) STREET: P.O. Box 4433 

(C) CITY: Houston 

(D) STATE: Texas 

(E) COUNTRY: U.S. 
<F) ZIP: 77210 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US Unknown 

<B) FILING DATE: Concurrently Herewith 

(C) CLASSIFICATION: Unknown 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/017,678 

(B) FILING DATE: 16-MAY-1996 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Kitchell, Barbara S. 

(B) REGISTRATION NUMBER: 33,928 

(C) REFERENCE/DOCKET NUMBER: TAMK:193 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (512) 418-3000 

(B) TELEFAX: (512) 474-7577 



(2) INFORMATION FOR SEQ ID NO:l: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATAACATCTG GGAATAAATC AACGAATGTT ACGGTTCATA AAAGTGAAGC GGGAACAAGT €0 

AGTGTTTTCT ATTATAAAAC GGGAGATATG CTACCAGAAG ATACGACACA TGTACGATGG 12 0 

TTTTTAAATA TTAACAATGA AAAAAGTTAT GTATCGAAAG ATATTACTAT AAAGGATCAG 180 

ATTCAAGGTG GACAGCAGTT AGATTTAAGC ACATTAAACA TTAATGTGAC AGGTACACAT 24 0 

AGCAATTATT ATAGTGGACA AAGTGCAATT ACTGATTTTG AAAAAGCCTT TCCAGGTTCT 3 00 

AAAATAACTG TTGATAATAC GAAGAACACA ATTGATGTAA CAATTCCACA AGGCTATGGG 3 60 

TCATATAATA GTTTTTCAAT TAACTACAAA ACCAAAATTA CGAATGAACA GCAAAAAGAG 42 0 

TTTGTTAATA ATTCACAAGC T 441 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Arg Gly Ser His His His His His His Gly Ser He Thr Ser Gly 
15 10 15 

Asn Lys Ser Thr Asn Val Thr Val His Lys Ser Glu Ala Gly Thr Ser 



20 



25 



30 



Ser 



Val Phe 
35 



Tyr Tyr Lys Thr Gly Asp Met Leu Pro Glu Asp Thr Thr 
40 45 



His 



Val Arg 
50 



Trp Phe Leu Asn He Asn Asn Glu Lys Ser Tyr Val Ser 
55 60 



Lys 
65 



Asp He 



Thr He Lys Asp Gin He Gin Gly Gly Gin Gin Leu Asp 

70 75 80 



Leu 



Ser Thr 



Leu Asn He Asn Val Thr Gly Thr His Ser Asn Tyr Tyr 
85 90 95 
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Ser Gly Gin Ser Ala lie Thr Asp 
100 



Phe Glu Lys Ala Phe Pro Gly Ser 
105 110 



Lys lie Thr Val Asp Asn Thr Lys 
115 120 



Asn Thr lie Asp Val Thr lie Pro 
125 



Gin Gly Tyr Gly Ser Tyr Asn Ser 
130 135 



Phe Ser lie Asn Tyr Lys Thr Lys 
140 



lie Thr Asn Glu Gin Gin Lys Glu 
145 150 



Phe Val Asn Asn Ser Gin Ala 
155 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GACGATAAAA ATGGAAAAAT ACAAAATGGT GACATGATTA AAGTGGCATG GCCGACAAGC 60 

GGTACAGTAA AGATAGAGGG TTATAGTAAA ACAGTACCAT TAACTGTTAA AGGTGAACAG 12 0 

GTGGGTCAAG CAGTTATTAC ACCAGACGGT GCAACAATTA CATTCAATGA TAAAGTAGAA 18 0 

AAATTAAGTG ATGTTTCGGG ATTTGCAGAA TTTGAAGTAC AAGGAAGAAA TTTAACGCAA 24 0 

ACAAATACTT CAGATGACAA AGTAGCTACG ATAACATCTG GGAATAAATC AACGAATGTT 300 

ACGGTTCATA AAAGTGAAGC GGGAACAAGT AGTGTTTTCT ATTATAAAAC GGGAGATATG 360 

CTACCAGAAG ATACGACACA TGTACGATGG TTTTTAAATA TTAACAATGA AAAAAGTTAT 42 0 

GTATCGAAAG ATATTACTAT AAAGGATCAG ATTCAAGGTG GACAGCAGTT AGATTTAAGC 480 

ACATTAAACA TTAATGTGAC AGGTACACAT AGCAATTATT ATAGTGGACA AAGTGCAATT 540 

ACTGATTTTG AAAAAGCCTT TCCAGGTTCT AAAATAACTG TTGATAATAC GAAGAACACA 600 

ATTGATGTAA CAATTCCACA AGGCTATGGG TCATATAATA GTTTTTCAAT TAACTACAAA 660 

ACCAAAATTA CGAATGAACA GCAAAAAGAG TTTGTTAATA ATTCACAAGC TTGGTATCAA 720 

GAGCATGGTA AGGAAGAAGT GAACGGGAAA TCATTTAATC ATACTGTGCA CAATATTAAT 780 

GCTAATGCCG GTATTGAAGG TACTGTAAAA GGTGAATTAA AAGTTTTAAA ACAGGATAAA 840 

GATACCAAG 849 
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(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



Met Arg Gly Ser His His His His His His Gly Ser Asp Asp Lys Val 
1 5 10 15 

Ala Thr He Thr Ser Gly Asn Lys Ser Thr Asn Val Thr Val His Lys 
20 25 30 

Ser Glu Ala Gly Thr Ser Ser Val Phe Tyr Tyr Lys Thr Gly Asp Met 
35 40 45 

Leu Pro Glu Asp Thr Thr His Val Arg Trp Phe Leu Asn He Asn Asn 
50 55 60 

Glu Lys Ser Tyr Val Ser Lys Asp He Thr He Lys Asp Gin He Gin 
65 70 75 80 

Gly Gly Gin Gin Leu Asp Leu Ser Thr Leu Asn He Asn Val Thr Gly 
85 90 95 

Thr His Ser Asn Tyr Tyr Ser Gly Gin Ser Ala He Thr Asp Phe Glu 
100 105 no 

Lys Ala Phe Pro Gly Ser Lys He Thr Val Asp Asn Thr Lys Asn Thr 
115 120 125 

He Asp Val Thr He Pro Gin Gly Tyr Gly Ser Tyr Asn Ser Phe Ser 
130 135 140 

He Asn Tyr Lys Thr Lys He Thr Asn Glu Gin Gin Lys Glu Phe Val 
145 150 155 160 

Asn Asn Ser Gin Ala Trp Tyr Gin Glu His Gly Lys Glu Glu Val Asn 
165 170 175 

Gly Lys Ser Phe Asn His Thr Val His Asn He Asn Ala Asn Ala Gly 
180 ;l85 190 

He Glu Gly Thr Val Lys Gly Glu Leu Lys Val Leu Lys Gin Asp Lys 
195 200 205 



Asp Thr Lys 
210 
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(2) INFORMATION FOR SEQ ID NO : 5 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCACGAGATA TTTCATCAAC GAATGTTACA GATTTAACTG TATCACCGTC TAAGATAGAA 60 

GATGGTGGTA AAACGACAGT AAAAATGACG TTCGACGATA AAAATGGAAA AATACAAAAT 12 0 

GGTGACATGA TTAAAGTGGC ATGGCCGACA AGCGGTACAG TAAAGATAGA GGGTTATAGT 180 

AAAACAGTAC CATTAACTGT TAAAGGTGAA CAGGTGGGTC AAGCAGTTAT TACACCAGAC 240 

GGTGCAACAA TTACATTCAA TGATAAAGTA GAAAAATTAA GTGATGTTTC GGGATTTGCA 300 

GAATTTGAAG TACAAGGAAG AAATTTAACG CAAACAAATA CTTCAGATGA CAAAGTAGCT 3 60 

ACGATAACAT CTGGGAATAA ATCAACGAAT GTTACGGTTC ATAAAAGTGA AGCGGGAACA 420 

AGTAGTGTTT TCTATTATAA AACGGGAGAT ATGCTACCAG AAGATACGAC ACATGTACGA 4 80 

TGGTTTTTAA ATATTAACAA TGAAAAAAGT TATGTATCGA AAGATATTAC TATAAAGGAT 540 

CAGATTCAAG GTGGACAGCA GTTAGATTTA AGCACATTAA ACATTAATGT GACAGGTACA 6 00 

CATAGCAATT ATTATAGTGG ACAAAGTGCA ATTACTGATT TTGAAAAAGC CTTTCCAGGT 660 

TCTAAAATAA CTGTTGATAA TACGAAGAAC ACAATTGATG TAACAATTCC ACAAGGCTAT 720 

GGGTCATATA ATAGTTTTTC AATTAACTAC AAAACCAAAA TTACGAATGA ACAGCAAAAA 780 

GAGTTTGTTA ATAATTCACA AGCTTGGTAT CAAGAGCATG GTAAGGAAGA AGTGAACGGG 840 

AAATCATTTA ATCATACTGT GCACAATATT AATGCTAATG CCGGTATTGA AGGTACTGTA 900 

AAAGGTGAAT TAAAAGTTTT AAAACAGGAT AAAGATACCA AGGCTCCTAT AGCTAATGTA 960 

AAATTTAAAC TTTCTAAAAA AGATGGATCA GTTGTAAAGG ACAATCAAAA AGAAATTGAG 1020 

ATTATAACAG ATGCAAACGG TATTGCTAAT ATTAAAGCGT TGCCTAGTGG AGACTATATT 1080 

TTAAAAGAAA TAGAGGCGCC ACGACCGTAT ACATTTGATA AGGATAAAGA ATATCCGTTT 1140 

ACTATGAAAG ATACAGATAA TCAGGGATAT TTTACGACTA TTGAAAATGC AAAAGCGATA 1200 

GAAAAAACAA AAGATGTTTC TGCTCAAAAG GTTTGGGAAG GCACTCAAAA AGTGAAACCA 1260 
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ACGATTTATT TCAAGTTGTA CAAACAAGAT GACAATCAAA ATACAACACC AGTAGACAAA 132 0 

GC AGAGATTA AAAAATTAGA AGATGGAACG ACAAAAGTGA CATGGTCTAA TCTTCCGGAA 1380 

AATGACAAAA ATGGCAAGGC TATTAAATAT TTAGTTAAAG AAGTAAATGC TCAAGGTGAA 1440 

GATACAACAC CAGAAGGATA TACTAAAAAA GAAAATGGTT TAGTGGTTAC TAATACTGAA 1500 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Arg Gly Ser His His His His His His Gly Ser Ala Arg Asp He 
15 10 15 

Ser Ser Thr Asn Val Thr Asp Leu Thr Val Ser Pro Ser Lys He Glu 
20 25 30 

Asp Gly Gly Lys Thr Thr Val Lys Met Thr Phe Asp Asp Lys Asn Gly 
35 40 45 

Lys He Gin Asn Gly Asp Met He Lys Val Ala Trp Pro Thr Ser Gly 
50 55 60 

Thr Val Lys He Glu Gly Tyr Ser Lys Thr Val Pro Leu Thr Val Lys 
65 70 75 80 

Gly Glu Gin Val Gly Gin Ala Val He Thr Pro Asp Gly Ala Thr He 
85 90 95 

Thr Phe Asn Asp Lys Val Glu Lys Leu Ser Asp Val Ser Gly Phe Ala 
100 105 110 

Glu Phe Glu Val Gin Gly Arg Asn Leu Thr Gin Thr Asn Thr Ser Asp 
115 120 125 

Asp Lys Val Ala Thr He Thr Ser Gly Asn Lys Ser Thr Asn Val Thr 
130 135 140 

Val His Lys Ser Glu Ala Gly Thr Ser Ser Val Phe Tyr Tyr Lys Thr 
145 150 155 160 

Gly Asp Met Leu Pro Glu Asp Thr Thr His Val Arg Trp Phe Leu Asn 
165 170 175 

He Asn Asn Glu Lys Ser Tyr Val Ser Lys Asp He Thr He Lys Asp 
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180 



185 



190 



Gin He Gin Gly Gly Gin Gin Leu Asp Leu Ser Thr Leu Asn He Asn 
195 200 205 

Val Thr Gly Thr His Ser Asn Tyr Tyr Ser Gly Gin Ser Ala He Thr 
210 215 220 

Asp Phe Glu Lys Ala Phe Pro Gly Ser Lys He Thr Val Asp Asn Thr 
225 230 235 240 

Lys Asn Thr He Asp Val Thr He Pro Gin Gly Tyr Gly Ser Tyr Asn 
245 250 255 

Ser Phe Ser He Asn Tyr Lys Thr Lys He Thr Asn Glu Gin Gin Lys 
260 265 270 

Glu Phe Val Asn Asn Ser Gin Ala Trp Tyr Gin Glu His Gly Lys Glu 
275 280 285 

Glu Val Asn Gly Lys Ser Phe Asn His Thr Val His Asn He Asn Ala 
290 295 300 

Asn Ala Gly He Glu Gly Thr Val Lys Gly Glu Leu Lys Val Leu Lys 
305 310 315 320 

Gin Asp Lys Asp Thr Lys Ala Pro He Ala Asn Val Lys Phe Lys Leu 
3 25 330 335 

Ser Lys Lys Asp Gly Ser Val Val Lys Asp Asn Gin Lys Glu He Glu 
340 345 350 

He He Thr Asp Ala Asn Gly He Ala Asn He Lys Ala Leu -Pro Ser 
355 360 365 

Gly Asp Tyr He Leu Lys Glu He Glu Ala Pro Arg Pro Tyr Thr Phe 
370 375 380 

Asp Lys Asp Lys Glu Tyr Pro Phe Thr Met Lys Asp Thr Asp Asn Gin 
385 390 395 400 

Gly Tyr Phe Thr Thr He Glu Asn Ala Lys Ala He Glu Lys Thr Lys 
405 410 415 

Asp Val Ser Ala Gin Lys Val Trp Glu Gly Thr Gin Lys Val Lys Pro 
420 425 430 

Thr He Tyr Phe Lys Leu Tyr Lys Gin Asp Asp Asn Gin Asn Thr Thr 
435 440 445 

Pro Val Asp Lys Ala Glu He Lys Lys Leu Glu Asp Gly Thr Thr Lys 
450 455 460 
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Val Thr Trp Ser Asn Leu Pro Glu Asn Asp Lys Asn Gly Lys Ala lie 
465 470 475 480 

Lys Tyr Leu Val Lys Glu Val Asn Ala Gin Gly Glu Asp Thr Thr Pro 
485 490 495 

Glu Gly Tyr Thr Lys Lys Glu Asn Gly Leu Val Val Thr Asn Thr Glu 
500 505 510 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Arg Gly Ser His His His His His His Gly Ser Met Val Ala Ala 
15 10 15 

Asp Ala Pro Ala Ala Gly Thr Asp lie Thr Asn Gin Leu Thr Asn Val 
20 25 30 

Thr Val Gly lie Asp Ser Gly Thr Thr Val Tyr Pro His Gin Ala Gly 

35 40 45 

Tyr Val Lys Leu Asn Tyr Gly Phe Ser Val Pro Asn Ser Ala Val Lys 
50 55 60 

Gly Asp Thr Phe Lys lie Thr Val Pro Lys Glu Leu Asn Leu Asn Gly 
65 70 75 80 

Val Thr Ser Thr Ala Lys Val Pro Pro lie Met Ala Gly Asp Gin Val 
85 90 95 

Leu Ala Asn Gly Val lie Asp Ser Asp Gly Asn Val lie Tyr Thr Phe 
100 105 110 

Thr Asp Tyr Val Asn Thr Lys Asp Asp Val Lys Ala Thr Leu Thr Met 
115 120 125 

Pro Ala Tyr lie Asp Pro Glu Asn Val Lys Lys Thr Gly Asn Val Thr 
130 135 140 

Leu Ala Thr Gly He Gly Ser Thr Thr Ala Asn Lys Thr Val Leu Val 
145 150 155 160 

Asp Tyr Glu Lys Tyr Gly Lys Phe Tyr Asn Leu Ser He Lys Gly Thr 
165 170 175 
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He Asp Gin He 
180 

Val Asn Pro Ser 
195 

Leu Lys Pro Asn 
210 

Ser He Lys Val 
225 

Tyr Phe Val Asn 



He Thr Phe Pro 
260 



Asp Lys Thr Asn 



Gly Asp Asn Val 
200 

Thr Asp Ser Asn 
215 

Tyr Lys Val Asp 
230 

Pro Glu Asn Phe 
245 

Asn Pro Asn Gin 



Asn Thr Tyr Arg 
185 

He Ala Pro Val 



Ala Leu He Asp 
220 

Asn Ala Ala Asp 
235 

Glu Asp Val Thr 
250 

Tyr Lys Val Glu 
265 



Gin Thr He Tyr 
190 

Leu Thr Gly Asn 
205 

Gin Gin Asn Thr 



Leu Ser Glu Ser 
240 

Asn Ser Val Asn 
255 

Phe Asn Thr Pro 
270 



Asp Asp Gin He Thr 

275 

Asp Pro Asn Ser Lys 
290 

Tyr Asn Ser Asn He 
305 

Ala Phe Asn Asn Gly 
325 

Val Pro Glu Gin Pro 
340 



Thr Pro Tyr He Val Val 
280 

Gly Asp Leu Ala Leu Arg 
295 

He Trp Arg Ser Met Ser 
310 315 

Ser Gly Ser Gly Asp Gly 

330 

Asp Glu Gin Ala 
345 



Val Asn Gly His He 
285 

Ser Thr Leu Tyr Gly 
300 

Trp Asp Asn Glu Val 
320 

He Asp Lys Pro Val 
335 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

t (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Arg Gly Ser His His His His His His Gly Ser Glu Gly Gly Gin 
1 5 .10 15 

Asn Ser Gly Asn Gin Ser Phe Glu Glu Asp Thr Glu Glu Asp Lys Pro 
20 25 30 

Lys Tyr Glu Gin Gly Gly Asn He Val Asp He Asp Phe Asp Ser Val 
35 40 45 
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Pro Gin lie His Gly Gin 
50 

Thr Glu Lys Asp Lys Pro 
65 70 

lie Asp Phe Asp Ser Val 
85 

Glu He He Glu Glu Asp 
100 

Gly Gly His Asn Ser Val 
115 

Ser Gly Gin Asn Glu Phe 
130 



Asn Lys Gly Asn Gin 
55 

Lys Tyr Glu His Gly 
75 

Pro His He His Gly 
90 

Thr Asn Lys Asp Lys 
105 

Asp Phe Glu Glu Asp 
120 

Asp He Lys Leu Asn 
135 



Ser Phe Glu Glu Asp 

60 

Gly Asn lie lie Asp 
80 

Phe Asn Lys His Thr 
55 

Pro Ser Tyr Gin Phe 
110 

Thr Leu Pro Lys Val 
125 
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All of the compositions and methods disclosed and claimed herein can be 
made and executed without undue experimentation in light of the present disclosure. While 
the compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied to 
the composition, methods and in the steps or in the sequence of steps-of the method 
described herein without departing from the concept, spirit and scope of the invention. 
More specifically, it will be apparent that certain agents which are both chemically and 
physiologically related may be substituted for the agents described herein while the same or 
similar results would be achieved. All such similar substitutes and modifications apparent 
to those skilled in the art are deemed to be within the spirit, scope and concept of the 
invention as defined by the appended claims. Accordingly, the exclusive rights sought to be 
patented are as described in the claims. 
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